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(54) Title: HEPATIT1S-C VIRUS TESTING 

(57) Abstract 

New stvles of hepatitis-C virus 
(HCV), referred to as HCV-3 and HCV- 
4. have been identified and sequenced. 
Antigenic regions of HCV-2, HCV-3 
and HCV-4 polypeptides have been 
identified. Immunoassays for HCV and 
antibodies thereto are described, which 
allow more complete screening of blood 
samples for HCV, and allow HCV gen- 
otyping. 
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HEPATITIS-C VIRUS TESTING 

Technical Field 

The present invention relates to the discovery of new 
types of hepatitis C virus, that we have termed type 3 
(HCV-3) and type 4 (HCV-4). In particular, it relates- to 
the etiologic agent of hepatitis C virus type 3 and 4, and 
to polynucleotides and immunoreactive polypeptides which 
are useful in immunoassays for the detection of HCV-3 and 
HCV-4 in biological samples; and also to the use of 
antigenic HCV-3 and HCV-4 specific polypeptides in 
vaccines . 

Background of the Invention 

Acute viral hepatitis is a disease which may result 
in chronic liver damage. It is clinically diagnosed by a 
well-defined set of patient symptoms, including jaundice, 
hepatic tenderness, and an increase in the serum levels of 
alanine aminotransferase and aspartate aminotransferase. 
Serologic immunoassays are generally performed to diagnose 
the specific type of viral causative agent. 
Historically, patients presenting with symptoms of 
hepatitis and not otherwise infected by hepatitis A, 
hepatitis B, Epstein-Barr or cytomegalovirus were 
clinically diagnosed as having non-A, non-B hepatitis 
(NANBH) by default. 
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clone was ligated with overlapping clones to produce a 
larger viral antigen (C100) corresponding to part of the 
NS3-NS4 genomic region. C100 was then fused with the 
human superoxide dismutase (SOD) gene, expressed in use as 
a large recombinant fusion protein (C100-3) and used on 
solid phase to develop radio-labelled (RIA) and 
enzyme-linked immunosorbent assays (ELISA) . 

Polynucleotides useful for screening for HCV are 
disclosed in European Patent Specification EP-A-0398748 . 
European Patent Specification EP-A-0414475 purports to 
disclose the propagation of HCV in culture cells and the 
production of antigens for use in diagnostics. European 
Patent Specification EP-A-0445423 discloses an improved 
immunoassay for detecting HCV antibodies. 

Blood banks in the United Kingdom have recently begun 
routine testing of blood donors for antibodies to 
components of HCV. One assay involves the detection of 
HCV antibodies to C100-3 polypeptides. The C100-3 
antibody recognises a composite polyprotein antigen within 
non-structural regions of the virus and is a consistent 
marker of HCV infection. However, in acute infections 
this antibody is unreliable because of the delay 
(typically 22 weeks) in seroconversion after exposure. 
Furthermore, the C100-3 antibody test lacks specificity 
for the hepatitis C virus . 

Second generation antibody tests employ recombinant 
antigens or synthetic linear peptides representing 
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analysis. The invention has thus identified 
polynucleotide sequences and polypeptides which are HCV-3 
and HCV-4 specific. These may be used to diagnose HCV-3 . 
and HCV-4 infection and should thus be included in any 
definitive test for HCV infection. 

One aspect of the invention provides polynucleotide 
sequences unique to hepatitis C virus types 3 and 4 (HCV-3 
and HCV-4) . The sequences may be RNA or DNA sequences. 
In principal any HCV-3 or HCV-4 specific polynucleotide 
sequence from non-coding, core, El, E2 or NS 1-5. genome 
regions can be used as a hybridisation probe. The 
sequences may be recombinant (i.e. expressed in 
transformed cells) or synthetic and may be comprised 
within longer sequences if necessary. Equally, deletions, 
insertions or substitutions may also be tolerated if the 
polynucleotide may still function as a specific probe. 
Polynucleotide sequences such as core, NS3, NS4 and NS5 
which code for antigenic proteins are particularly useful. 

Another aspect provides an antigenic HCV-3 or HCV-4 
specific polypeptide, particularly from the core, NS3, NS4 
or NS5 regions (e.g. the HCV-3 or HCV-4 counterparts of 
C100 polypeptide, 5-1-1 polypeptide, C3 3 polypeptide or 
C22 polypeptide or epitopes thereof) or polypeptides 
including these antigens. 

A further aspect of the invention provides labelled 
antigenic HCV-3 or HCV-4 specific polypeptide (or mixtures 
thereof, particularly from the core and NS4 regions) for 
use in an immunoassay. 
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In particular, the invention allows blood donor 
screening by conventional assays (using HCV type 1 encoded 
antigens) to be supplemented with a second test that 
contains two oligopeptides corresponding to first and 
second antigenic regions found in the NS-4 sequence of HCV 
type 3 (positions 1691 to 1708; sequence 

KPALVPDKEVLYQQYDEM and positions 1710 to 1728; sequence 
ECSQAAPYIEQAQVIAHQF) and two derived from the equivalent 
regions of HCV type 2, R ( A/V) V (V/I) ( A/T) PDKE (I/V) LYEAFDEM 
and ECAS (K/R) AALIEEGQR (M/I ) AEML. 

The corresponding HCV-4 antigens from substantially 
positions 1691 to 1708 and 1710 to 1723 may be used for 
HCV-4 detection. 

Thus, the present invention has also identified 
corresponding polynucleotide and polypeptide sequences 
which may be used to identify hepatitis C type 2 viral 
infection. 

Production and detection of the antigen-antibody 
immune complex may be carried out by any methods currently 
known in the art. For example, a labelling system such 
as enzyme, radioisotope, fluorescent, luminescent or 
chemiluminescent labels may be employed, usually attached 
to the antigen. Labelled anti-antibody systems may also 
be used. The recombinant antigen may be either used in 
liquid phase or absorbed onto a solid substrate. 

Oligopeptides corresponding to the antigenic regions 
of all three major types may also be used separately to 
serologically distinguish individuals infected with 



WO 93/10239 



PCT/GB92/02143 



- 9 - 

non-coding region of KCV samples from 18 blood donors and 
a comparison with previously published nucleotide 
sequences (see Table 2); sequence' numbering corresponding 
to the prototype HCV-1 sequence (ref .4) and previous 
designations of type 1 or 2 being indicated; 

Figure 2 is a phylogenetic analysis showing 
clustering of the sequences into three -types viz; HCV-1, 
HCV-2 and HCV-3 for the 5' NCR results of Figure 1 using 
the maximum likelihood algorithm, shown as an unrooted 
tree. Numbers 1-18 in full circles correspond to blood 
donor sequences E-bl through E-bl8. Numbers 1 to 26 in 
open circles correspond to the previously published 
sequences identified in Table 2.; 

Figure 3 is a comparison of deduced amino acid 
sequences in the NS-5 region of blood donors (E-bl, E-b2, 
E-b3, E-b7 (type 3) and E-bl2 (type 2) with those 
previously published (Table 2) . Amino acid residue 
numbering follows that of the HCV-1 polyprotein ( 4 ) and 
uses single letter amino acid codes; 

Figure 4 is a phylogenetic . analysis of the NS-5" 
region using the maximum likelihood algorithm, shown as an 
unrooted tree. Symbols are as described for Figure 2; 

Figure 5 is a comparison of deduced amino acid 
sequences in the NS-3 region of blood donors (E-bl, E-b2, 
E-b6, E-b7 (type 3) with those previously published 
(Table 2). Group 1/1: amino acid sequence of fl, f 3 , f4, 
f5, h2 , h3 , h4 (one), i2, i3 , i4 , pi, p2 ; Group 1/2: 
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cysteine; Q: glutamine; E: glutamic acid; G: glycine; H: 
histidine; I: isoleucine; L: leucine; K: lysine; M: 
methionine; F: phenylalanine; P: proline; S: serine.;- T: 
threonine; W: trytophan; Y: tyrosine; V: valine."-.": 
sequence nor determined; difference from consensus shown 
in bold . 

Figure 10(a) shows a comparison of amino acid 
sequences between residues 1679 and 1768 (Choo et al . , 
1991) of the three major variants of HCV. T16, T42, T77, 
T1801, T1825: Scottish blood donors infected with HCV 
type l; T3 51: Scottish blood donor infected with HCV type 
2; T59, T940, T810: Scottish blood donors infected with 
HCV type 2; T40, T33, T36, T26, T1787: Scottish blocd 
donors infected with HCV type '3; and Figure 10(b) shows - 
the derivation of consensus sequences for HCV types 3, 2 
and 1 oligopeptide series. Differences. from consensus 
shown in bold, Amino acid codes: A: alanine; R: 
arginine; N: asparagine; D: aspartic acid; C: cysteine; 
Q: glutamine; E: glutamic acid; G: glycine; H: histidine; 
I : isoleucine ; L: leucine; K: lysine; M: methionine; F: 
phenylalanine; P: proline; S: serine; T: threonine; W: 
tryptophan; Y: tyrosine; V: valine; " . " : not determined. 

Figures 11(a) to 11(c) show amino acid sequences of 
nonameric oligopeptides used for epitope mapping, derived 
from consensus HCV type 3 , type 2 and type 1 sequences 
respectively. Amino acid codes: A: alanine; R: arginine; 
N: asparagine; D: aspartic acid; C: cysteine; Q: 
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For clarify, only non-identical sequences are shown in 
tree; e.g. Sequence 1 corresponds to those found in 
samples Eg-1,6 and Eg-29 etc. (Figure 1) . Hollow squares 
are published sequences from Zaire; Hollow small circles 
are sequences from South Africa; Hollow small solid 
circles are sequences obtained elsewhere in the world. 

Figure 15 A/B is a comparison of nucleotide (A) and 
amino acid (B) sequences in the core region. Symbols as 
for Figure 1-3. Single letter amino acid codes are used; 

Figure 16 is a phylogenetic analysis of part of the 
core region using the maximum likelihood algorithm, shown 
as an unrooted tree. Sequences are numbered as in Figure 
14; sequence 3 0 is that of HC-J8 (Okamato et al. Virology 
188:331 -341).; and 

Figure 17 shows cleavage patterns f.or A) Haelll/Rsal 
and B) ScrFI in 5 'NCR. 

I) ANALYSIS OF HEPATITIS C VIRUS AND 
PHYLOGENETIC RELATIONSHIP OF TYPES 1, 2 and 3 
Introduction 

Sequence analysis of : the 5' non coding region of 
hapatitis C virus (HCV) amplified from the plasma of 
individuals infected in Britain revealed the existence of 
three distinct groups of HCV, differing by 9-14% in 
nucleotide sequence. Two of the groups identified were 
similar to those of HCV variants previously termed type 1 
and type 2, while the third group appeared to represent a 
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ssibility that mismatches between the primers and the 
riant sequence Will prevent amplification. -We have used 
several strategies to overcome this problem. For initial 
virus detection, we used primers in the 5 'NCR, which are 
reported to highly conserved amongst type 1 variants (4, 
11, 13, 16, 23, 24, 26, 33), and between Kl and K2 (23). 
Sequence analysis of the blood donors allowed the 
identification of type 1 and type 2 variants by comparison 
with. published sequence data. This- analysis .also revealed . 
the existence of a third "type" . of HCV that appeared to be 
as distinct from type 1 as type 2 was (Figs. 1,2; Table 3). 
Based on our initial tentative classification, we sought 
corroboration of our findings in other (coding) and more 
variable regions of the viral genome. . •; .' • % , 

Analysis of the NS-5" region, which was based on ' ■ ■ • 

several sequences of each of the three types (Figs.- 3, 4; 
Table 3), confirmed the existence of 3 major groups, with 
type 3 sequences forming a relatively homogeneous group 
that was quite distinct from types 1 and 2 . The proposed 
separation of type 1 sequences into PT and Kl "sub-types" 
and type 2 sequences into K2a and K2b is supported by this 
analysis, in which the single type 2 blood donor sequence 
obtained in this study appears most similar to K2b . 
Differentiation of HCV type 1 sequences into two groups is 
also clearly shown in tne core (FigTT) and NS-3 regions ... 
(Fig. 5), in both cases with the type 3 sequences appearing 
considerably more distant. 
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reading frame encoding a poiyprotein that is subsequently 
cleaved into structural and non-structural proteins. Weak 
sequence homologies have been detected with several other 
virus groups that have positive-sense RNA genomes 
(19,21). Although the overall degree of sequence 
dissimilarity between types 1, 2 and 3 cannot be measured 
by comparison of the small regions of sequence analysed in 
this study, a rough estimate of the extent of divergence 
in protein coding regions is given by an examination of 
the divergence of the partial core sequence. This shows 
that the difference between HCV type 1 and type 3 core 
region (approximately 10% amino acid sequence divergence) 
is comparable to that which exists between different 
serotypes of the flavivirus, tick-borne encephalitis virus 
(14%; ref .20) , but lower than that which is found between 
serotypes of a mosquito borne flavivirus, dengue fever 
virus (33%) , and the West Nile (WN) subgroup (28-43% 
divergence). The 5 'NCR sequences of. the different members 
of WN subgroup are also considerably more diverse than 
those of the three types of HCV (=50% similarity; ref. 5), 
although within each of the members e.g. Murray Valley 
encephalitis virus, the 5 'NCR is extremely well conserved 
(>95%- similarity; ref-,5) . On the basis of these 
analogies, we speculate that the major types of HCV 
represent distinct "serotypes", each capable of human 
infection irrespective of the immune response mounted 
against other HCV types. 
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transcriptase (Promega) in 20ul buffer containing 50 mM 
Tris-HCl (pH 8.0), 5mM MgCl 2 , 5 mM dithiothreitol , 50 mM 
KC1, 0.05 ug/ul BSA, 15% DMSO, 600 uM each of dATP, dCTP, 
dGTP and TTP , 1.5 uM primer and 10 U RNAsin (Promega). 

PCR was performed from 1 ul of the cDNA over 2 5 
cycles with each consisting of 25 sec. at 94°C, 35 sec. at 
50°C and 2.5 min. at 68°C. The extension time for the 
last cycle was increased to 9.5 min. The reactions were 
carried out with 0.4 unit Taq polymerase (Northumbria 
Biologicals Ltd.) in 20 ul buffer containing 10 mM 
Tris-HCl, pH 8.8, 50 mM KC1, 1 . 5 mM MgCl 2 , 0.1% Triton 
X-100, 3 3 uM each. of dATP , dCTP, dGTP and dTTP and 0.5 uM 
of each of the outer nested primers. One ul of the 
reaction mixture was then transferred to a second tube 
containing the same medium but with the inner pair of 
nested primers, and a further 25 heat cycles were carried 
out with the same programme. The PCR products were 
electrophoresed in 3% low melting point agarose gel (131) 
and the fragments were detected by ethidium bromide 
staining and UV illumination. For sequence analysis, 
single molecules of cDNA were obtained at a suitable 
limiting dilution at which a Poisson distribution of 
positive and negative results was obtained (30). 

Direct Sequencing of PCR Products . The PCR products were 
purified by glass-milk extraction ( "GeneClean" ; BiolOl, 
Inc.). One quarter of the purified products was used in 
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To establish the interrelationships of the major 
types of HCV, we have separately analysed several regions 
of the viral genome that differ in sequence variability 
and evolutionary constraint. Thus the conclusions drawn 
from the sequence comparisons are not subject to spurious 
evolutionary phenomena that may affect a particular 
region. However, one problem with the analysis presented 
here was the absence of a viral sequence that was 
sufficiently distantly related to HCV to serve as an 
out-group. Thus, although we describe the 

interrelationships of different sequence variants of HCV, 
it should be stressed that we have no means of deciding 
which sequence is ancestral to the others. The trees are 
thus drawn in the less familiar un-rooted form to indicate 
this . 

RESULTS 

1] AnalV5iS ° f th e 5/ — - samples were 

obtained from 18 blood donors that were repeatedly 
reactive in the Abbott 2nd Generation enzyme immunoassay 
and which were confirmed or indeterminate in the Chiron 
4-RIBA (E-bl through E-bl8 , ref.10). HCV sequences 
present in stored plasma samples from each donor were 
amplified with primers corresponding to sites in the 5 'NCR 
(12,25) that are well conserved between all known HCV type 
1 and type 2 variants (4,11,13,16,23,24,26,33). 
Sequencing of the PCR product, after limiting dilution to 
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2) Analysis of the NS-5 Region . The nucleotide sequence 
of the NS-5 region has been found to vary significantly 
between the previously described Kl and K2 variants of HCV 
(7). To investigate whether type 3 sequences were equally 
distant from the other two types in this region as well as 
in the 5 'NCR, we compared sequences from four type 3 blood 
donors (E-bl, E-b2, E-b3 and E-b7) and one type 2 donor 
(E-bl2) with previously published sequences (Fig. 3; 
Fig. 4 ; table 3) . 

A remarkable variation was observed between sequences 
of the three types in this region. Again, type 3 
sequences form a separate group from type 1 and type 2 in 
this region. However, unlike the 5 'NCR, there appear to 
be subdivisions within the type 1 and type 2 groups. Type 
1 sequences are split between those found in Japanese 
infected individuals (e.g. HCV- J ; HCV-BK; sequence numbers 
12,- 13, 16-20 in table 2) and those of U.S.A. origin 
(HCV-l, Pt-1, H77, H90; sequence numbers 1-4; fig. 4). 
There is also some evidence for a split between type 2 
sequences, those corresponding to their previous 
designation as K2a (7) appearing distinct from type K2b 
sequences and the Scottish blood donor, E-bl2. 

Table 3 shows that the average nucleotide distances 
between the two groups of HCV type 1 sequences is 25% 
(indicated here as type la [USA] and type lb [Japanese]), 
with variation of only 4-7% within each group. The 
nucleotide sequence divergence within the two type 1 
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These sequences are representative of the range of 
variation found in this region in individuals infected in 
Britain; comparison of the tree previously published (31) 
with fig. 6 shows that the former forms a very small 
component of the overall tree obtained once Japanese type 
1 and type 3 sequences are added. 

The maximum likelihood tree shows that type 1 and 
type 3 have diverged considerably from each other. As was 
found in the NS-5 region, subtypes of type 1 sequences are 
found in NS-3 . Again, sequences of Japanese origin 
(HCV-J, HCV-BK and JH) are distinct from the prototype 
(PT) sequence, and those found in Scottish blood donors 
(E-bl6, E-bl7, pl-3), IVDUs (il-5) and haemophiliacs 
(hl-5) , all of which correspond to the prototype sequence 
(fig. 5). However, the average subtype difference (23%) is 
lower than those that exist between HCV-1 and HCV-J with 
the four type 3 sequences (37-43%). As reported 
previously (31), the majority of nucleotide substitutions 
that exist between type 1 sequences are silent (i.e. do not 
affect the encoded amino acid sequence) , while numerous 
amino acid substitutions exist between type 1 and type 3 
sequences (fig. 5). The analysis of the NS-3 region 
includes the sequence of clone A (35) which was obtained 
from Japanese patients with NANB hepatitis, and which was 
reported to be distinct from- existing HCV type 1 
sequences. In Fig. 6, this sequence appears to be 
distinct from both HCV type 1 and type 3, with corrected 
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TABLE 1 

SEQUENCES AND SOURCES OF PRIMERS USED FOR AMPLIFICATION OF HCV GENOME. 



Name Region Position Sense' Sequences 5* -3' R£I - 

ot 5'basc' 

209 5'NCR 8 - AT ACTCG AG GTGC ACGGTCT ACG AG ACCT (12) 

211 5'NCR -29 - CACTCTCGAGCACCCTATCAGGCAGT (12) 

939 5'NCR -297 + CTGTGAGG AACTAC IGILl 1 (25) 

940 5'NCR -279 + TTCACGCAGAAAGCGTCTAG (25) 



410 CORE 410 - ATGTACCCCATGAGGTCGGC 

-406 CORE -21 + AGGTCTCGTAGACCGTGCATCATGAGCAC 



:288 NS-3 4951 - CCGGC ATGC ATGTC ATG ATGTAT (31) 

290 NS-3 4932 - GT ATTTGGTG A CTG GGTGC GTC (3D 

208 NS-3 4662 - TCTTGAA I 1 i 1 GGG AGGGCGTCTT 

207 NS-3 4699 + CATATAGATGCCCACTTCCTATC 



007 NS-4 5293 - AACrCGAGTATCCCACTGATGAAGTTCCACAT 
.220 NS-4 5278 - CACATGTGCTTCGCCCAGAA 

HCV rvpe J.i 

221 NS-4 4858 GG ACCT ACGCCCCTTCT ATA 

008 NS-4 4878 + TCGGTTGGGGCCTGTCCAAAATG 

HCV rvpe 2: 

281 NS-4 4858 + GGTCCCACCCCTCTCCTGTA 

509 NS-4 4878 -r CCGCTTGGGTTCCGTTACCAACG 

HCV rvpe 1: 

253 NS-4 4858 - GGGCCAACACCCCTGCTATA 

196 NS-4 4878 - CAGACTGGGCGCCGTTCAGAATG 
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TABLE 2 

SOURCE AND CITATION OF PREVIOUSLY PUBLISHED HCV SEQUENCES USED IN THIS STUDY 



Geographical 

No. Type Abbreviation Source Reference Rsf. No. 



I 1 
L I 


nL v - i 




f^Vir\r\ or nl 1 QQ 1 

Lnoo ci ai.. i77 i 






Pr I 


Japan 


iNaXHO €1 UL.. I77I 










enomoto ut.» xyvu 


W ) 


J. H 1 


ri / / . M7U 




Aoori « r n I 1 QQ 1 

\J)laiux SI ui.. 177 1 


\-*+) 


J. O 1 




kjermany 


Pm~fic at rt} 1 QQ 1 

rue no ei cil 177 i 


MM 


7 1 


T 1 
J 1 


Japan 


naii ct fli.. i77i 




8 1 


Al 


Australia 


Han et aL. 1991 


(13) 


9 1 


SI 


S. Africa 


Han et aL. 1991 


(13) 


10 I 


Tl 


Taiwan 


Han et aL. 1991 


(13) 


11 1 


L' 18/124 


U.S.A/Italy 


Han c-r aL. 1991 


(13) 


12 1 


HCV-J 


Japan 


Kato^r a/.. 1990 


(16) 


13 1 


HCV-BK 


Japan 


Takamizawa et aL. 1991 


(33) 


14.15 1 


HC-J1. -J4 


Japan 


Okamoto et aL. 1990 


(26) 


16-20 1 


KL Kl- 1-4 


Japan 


Enomoto et aL. 1990 


(7) 


21 1 


JH 


Japan 


Kubo *r ai.. 1990 


(17) 


22 1 


J7 


Japan 


Takeuchi et aL. 1990 


(34) 


23-26 2 


K2a. K2a-1. 


Japan 


Nakao et aL. 1991 


(23) 




K2b. K2b-l 




Enomoto er aL. 1990 


(7) 


27 ? 


Clone A 


Japan 


Tsukiyama-fCohara. 1991 


(35) 
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II) . SEROLOGICAL REACTIVITY OF BLOOD DONORS INFECTED WITH 

THREE DIFFERENT TYPES OF HEPATITIS C VIRUS. 

HCV sequences were amplified in the 5 'non-coding 
region (5 'NCR) , core, NS-3 and NS-5 regions from blood 
donors, haemophiliacs and intravenous drug abusers. 

Blood donations that were repeatedly reactive on 
screening with Abbott 2nd generation enzyme immunoassay 
(EIA) and positive or indeterminate by Ortho recombinant 
immunoblot assay (RIBA) were amplified by primers in the 
5'NCR (reference 10). The first fourteen PCR-positive 
blood donations (where PCR was used to amplify and thus 
detect HCV RNA present in the blood) were then typed by 
sequence analysis of the amplified region, and compared 
with their serological reactivity to a range of structural 
and non-structural peptides in two 1st generation EIAs 
(Ortho HCV ELISA; Abbott HCV EIA) and two RIBA assays 
(Ortho RIBA and Innogenetics LIA; Table 4) . The five 
donations containing HCV type 1 sequences were positive in 
both EIAs, reacted with all antigens in the Ortho RIBA 
assay, and were broadly reactive in the LIA. However, all 
but two of the sera from donors with type 2 and 3 
infections were completely negative on anti-ClOO EIA 
screening and failed to react with 5-1-1, C100 (RIBA) and 
NS4 (LIA) . 
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TABLE 4 



SEROLOGICAL REACTIVITY OF SERA FROM BLOOD DONORS 
INFECTED WITH THREE TYPES OF HEPATITIS C VIRUS 



Donor 


HCV 


anci - 


Or 


tino RXBA 




Xnnogc 


»na 


ties 


LXA 




Number 


genotype 


C100 




















0* 


At 


5-1-1 


C1Q0 C33 


C22 


KS4 


NS5 


CI 


t C2 


C3 


C4 


B-bl3 






> 


35 


4 4 


4 


25 


3 


1 


2 


1 


1 


B-blS 








4 


4 4 


4 


2 


3 


3 


2 


2 


1 


B-bl6 


1 




> 


4 


4 4 


4 


2 


3 


2 


3 


3 




B-bl7 








4 


4 4 


4 


3 


3 


3 


2 


1 


1 


B-bl8 








4 


4 4 


4 


3 




2 


1 


1 




B-b9 










1 3 


4 






3 
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*Ortho HCV ELISA (Recombinant CI 00-3) 

f Abbott HCV EIA (Hepatitis C Recombinant DNA Antiecn) 

iCone oligopeptides. L-4 

§Bands scored - (negative) to 4 (strong positive) according to manufacturers instructions. 
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with primers corresponding to sense and anti-sense 
sequences spanning the antigenic region of NS-4 (table 
1) . Nucleotide sequences obtained from the amplified DNA 
were compared and used to define consensus sequences for 
each HCV type. In-frame translation of the nucleotide 
sequences yielded an uninterrupted consensus amino acid 
sequence that was used to define a series of overlapping 
oligopeptides for epitope mapping. 

Eoitooe mapping and determination of antibody specificities 

Overlapping synthetic peptides were synthesised on 
polypropylene pins using ki is commercially available from 
Cambridge Research Biochemicals Ltd. The principle of the 
addition reactions is described in refs (Geysen et al . , 
1984; Geysen et al., 1985). Antibody reactions were 
carried out on pins disrupted by sonication (30 minutes) 
in 1% sodium dodecyl sulphate, 0.1% 2-mercaptoethanol , 
0 . 1M sodium dihydrogen orthophosphate . Pins were 
pre-coated in 1% ovalbumin, 1% bovine serum albumin, 0.1% 
Tween-20 in phosphate buffered saline (PBS) for one hour 
at room temperature. Serum or plasma was diluted 1.40 in 
PBS + 0.1% Tween-20 (PBST) and incubated with the blocked 
pins at 4°C for 13 hours. After washing in 4 changes of 
PBST (10 minutes at room temperature, with agitation), 
bound antibody was detected by incubation in a 1/20000 
dilution of affinity isolated anti-human IgG, peroxidase 
conjugate (Sigma) for one hour at room temperature. 



WO 93/10239 



PCT/CB92/021 



- 37 - 

considerably from the amino acid sequences of other blood 
donors infected with HCV types 1 and 2 (T16, 42, 77 , 1801, 
1825, 351, 940 and 810; Figure 6a). Sequence comparison 
between the major HCV types from residues 1679 to 1769 
reveals three regions of considerable amino acid sequence 
variability. Most of the observed differences between 
types involve non-synonymous amino acid substitutions, 
particularly alternation of acidic and basic residues in 
the hydrophillic regions. These changes would be expected 
to profoundly alter the overall conformation of the 
protein, and its antigenicity. 

The consensus amino acid sequences in this region of 
types 1-3 (Figure 6b) were used to define three series of 
82 nonameric oligopeptides overlapping by eight of the 
nine residues with' those before and after in the series 
(Figure 7a-c) . These were synthesised on a 12 x 8 arrays 
of polypropylene pins as described in Methods. Antibody 
reactivity to the immobilised .antigens on' the pins was 
determined by indirect ELISA, using an overnight 
incubation with a 1/40 dilution of test serum overnight at 
4°C, followed by washing, and detection with an anti-human 
IgG-peroxidase conjugate and appropriate substrate (see 
Methods) . 

Reactivity of an anti-HCV negative, PCR-negative 
donor, with no known risk factors for HCV infection with 
the three series of peptides was determined. No 
significant reactivity is shown with any of the 
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exceeds that found in the core, NS-3 or NS-5 regions 
previously analysed (Chan et al, 1992). The function of 
the protein encoded by this region of the HCV genome is 
unknown, and the consequences of this variability on virus 
replication and pathogenesis are unknown. The function of 
the NS-4 region in flaviviruses and pestiviruses is also 
poorly defined. 

The degree of amino acid sequence variability, and 
the nature of the amino acid substitutions indicate that 
the major sites of antibody reactivity are also those of 
antigenic variability. This undoubtedly underlies the 
restricted cross-reactivity of HCV type 1 NS-4 encoded 
antigens with sera from individuals infected with 
differenr HCV types. Serological diagnosis of infection 
is currently based entirely on recombinant or synthetic 
oligopeptide sequences derived ultimately from HCV type 1 
sequences (Choo et al., 1991). The serological response 
to infection is often very restricted in its initial 
stages, with antibody to only one of the recombinant 
antigens used for screening. Not only does this present 
difficulties with supplementary antibody tests, where 
reactivity to two HCV-encoded antigens is required for 
confirmation, but can lead to an increased probability of 
failing to detect early infection with HCV types 2 and 3. 

Table 7 relates HCV typing determined by PCR to 
results obtained using type-specific antigens (TSA) and 
shows good correlation for HCV1-3 types. 
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TABLE 6 

SEQUENCES OF OLIGONUCLEOTIDES SUITABLE FOR DIRECT DETECTION OF 
HCV TYPE 3 IN CLINICAL SPECIMENS BY POLYMERASE CHAIN REACTION 



Name Resion 



Position 
of 5'base" 



Pol.t 



Sequences 5'-3'i 



007 
220 



NS-4 
NS-4 



5293 
5278 



AACTCGAGTATCCCACTGATGAAGTTCCACAT 
C ACATGTGCTTCGCCC AG AA 



Type 5i 
TS-3a NS-4 
TS-3b NS-4 



5140 
5161 



GCCGCCCCATATATCGAACA 
GCTCAGGTAATAGCCCACCA 



Type 2: 
TS-2a NS-4 
TS-2b NS-4 



5140 
5161 



AAAGCCGCCCTCATTGAGGA 
GGGCAGCGGATGGCGGAGAT 



Type 1: 
TS-la NS^ 
TS-lb NS-4 



5140 
5161 



CACTTACCGTACATCGAGCA 
GGGATGATGCTCGCCGAGCA 



* Position of 5'base relative to HCV genomic sequence in Choo et al.. (1991). 

t Oricnrarion of primer sequence (+: sense: -: anti-sense) 

± Abbreviations: A: adenine. C: cyridine: G: euanidine. T: thymidine. 

1 Type-specific sense primers for amplification of HCV types 3. 2 and 1 variants. 
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PART IV IDENTIFICATION OF HCV TYPE -4 

Introduction 

Investigations were carried out on sequence 
variations in the 5' non-coding region (5 'NCR) of HCV 
samples froa a variety of worldwide geographical locations 
(Figure 13), and also in the core region (Figures ISA and 
15B) . Phylogenetic analysis (Figures 14 and 16) revealed 
a new distinct HCV type which we refer to herein as HCV-4 . 

Methods 

Samples . RNA was extracted from plasma samples that were 
repeatedly reactive on second generation screening assays 
for HCV, and which were either confirmed (significant 
reactivity with two or more antigens in the Chiron 
recombinant immunoblot assay; Chiron Corporation, 
Emeryville, California, USA) or indeterminate (reactive 
with only one antigen) from blood donors and patients with 
NANBH. Most of the samples containing sequences that 
differed substantially from known HCV types came from 
Egypt (EG 1-33). Others came from Holland (NL-26) , Hong 
Kong (HK 1-4), Iraq (IQ-43) and XX (xx-(6). 

Sequence determination . HCV sequences were reverse 
transcribed and amplified with primers matching conserved 
regions in the 5'NCR as previously described [1]. For 
analysis of the core region, RNA was reverse transcribed 
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Results 

Divergent: 5 'NCR sequences . Several sequences in the 5'NC 
region detected in samples of blood donors from Saudi 
Arabia, Holland and Hong Kong, and from NANBH patients in 
Iraq and xxx differed substantially from those found in 
Scottish blood donors and those reported elsewhere (Figure 
13), Instead of showing the well characterised nucleotide 
substitutions that distinguish HCV types 1, 2 and 3 from 
each other, a new set of sequence differences were 
observed in the new variants that appeared to place them 
outside the existing system of virus classification. This 
can be more simply represented by reconstructing a 
phylogeny of the sequences and presenting the results as 
an evolutionary tree (Figure 14) . This analysis confirms 
that sequences 1-10 cluster separately from the variants 
previously typed as 1,2 and 3. For convenience we will 
refer to sequences within this new group as type 
HCV type 4 . Mean distances within type 4 and between type 
4 and the other HCV types in the 5 'NCR were comparable to 
those previously described for type 1-3. Although 
sequences within type 4 are relatively closely grouped, 
sequences 11, 12 and 13 differ considerably from any of 
the known types . 

Using this phylogenetic tree, it can be seen that the 
majority of previously published 5 'NCR sequences can be 
readily identified as types 1, 2 or 3 . Furthermore, 
almost all of the sequences from Zaire (shown as hollow 
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PART V HCV TYPING 
Introduction 

In view of the sequence variations between HCV types 
1,2,3 and 4 differences in restriction enzyme cleavage sites 
exist, leading to different endonuclease cleavage patterns. 
This technique was used to identify HCV genotypes in blood 
samples from a variety of sources worldwide. 

fA) Typing of HCV1-3 
METHODS 

Serum Samples : Samples from blood donors in six countries, 
Scotland, Finland, Netherlands, Hong Kong and Australia and 
Japan, were available from routine 2nd Generation anti-HCV 
ELISA screening (Ortho or Abbott) . Donor samples that were 
repeatedly reactive in the above tests were further 
investigated using a supplementary test (Oirtho RIBA: 
Finland, Netherlands, Australia, Egypt, Abbott Matrix: Hong 
Kong) or samples were titred for anti-HCV by ELISA (Japan) . 
Samples that were positive (significant reactivity with two 
or more HCV antigens (1+ to 4+) or indeterminate (reactivity 
with one antigen only) in the RIBA test or had a titre of > 
X 4096 by ELISA (Japan only) were tested for viral RNA by 
Polymerase Chain Reaction (PCR) . 

RNA PGR : PCR for the detection of HCV RNA was carried out as 
previously described by Chan et al (reference la) using 
primers in the 5 7 non-coding region (5 'NCR) in a nested PCR, 
with primers 209/939 and 211/940(5) in first and second 
reactions respectively. 
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TABLE S 



PREVALENCE OF HCV TYPES IN DIFFERENT COUNTRIES 



COUNTRY 



HCV TYPES (%) 
HCV-1 HCV -2 HCV- 3 



Scotland 



86(51%) 21(13%) 60(36%) 



Finland 



3 (25%) 



5(42%) 



4(33%) 



Netherlands 



18 (60%) 



7(23%) 



5(17%) 



Hong Kong 



22 (63%) 



0 (0%) 



0(0%) 



Australia 



13 (57%) 



3 (13%) 



7 (30%) 



JaDan 



31 (77%) 



9 (23%) 



0(0%) 



Egypt 



0(0%) 



0 (0%) 



0(0%) 
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PCR and typing . RNA was reverse transcribed with primer 
94 0 and cDNA amplified in a two stage nested PCR reaction 
with primers 940/939, followed by 209/211 as previously 
described (Chan et al. 1992). PCR product was 
radiolabelled with [ 35 S]-dATP analysed by restriction 
endonuclease cleavage (McOmish et al. Transfusion, 
32: no. 11 1992). Samples were cleaved with ScrFI and a 
combination of Haelll/Rsal in two separate reactions to 
identify HCV types 1/4, 2, 3. Figure 17 shows 
endonuclease cleavage patterns. HCV types 1 and 4 were 
differentiated by a third reaction with Hinfl (see 
Results) . Two samples yielded restriction patterns that 
were different from those of the four known types of HCV 
and were analysed further by direct sequence analysis of 
the amplified DNA (Chan et al . 1992). These two samples 
contained 5 'NCR sequences distinct from those of known HCV 
types and currently remain unclassified. 

RESULTS 

Modification of RFLP method to identify HCV type 4. 
Previous sequence analysis in the 5 'NCR of HCV amplified 
from plasma of Egyptian blood donors revealed a relatively 
homogeneous group of novel sequence variants in both the 
5 'NCR and core region which were as distinct from HCV 
types 1, 2 and 3 as these latter types were from each 
other (see previous submission) . This new group was 
designated as HCV type 4. 
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initially carried out with Haelll/Rsal and ScrFl, and*" 
allowed the identification of 10 type 2 and 10 type 3 
variants (Table 10) . Samples showing electrophoretic 
patterns aA/3 or bA/B were further analysed by cleavage 
with hinf I , yielding 3 3 samples with pattern a, thus 
identified as type 1, 22 with pattern b and 2 with pattern 
c, both identified as type 4. Finally, two samples showed 
the unusual cleavage patterns h and i with Haelll/Ksal and 
pattern b with Hinf I, and were therefore directly 
sequenced. These two sequences were similar to each other 
but were unlike any of the known HCV types , and also 
distinct from EG-28, the other sequence showing pattern i 
with Haelll/Rsal (Table 10) . As they cannot be currently 
classified, they will be referred to as type U. 
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TABLE 10 

mENTTFICATTON OF HCV TYPES 1-4 ,N STUDY SUBJECTS BY RFLP 
ANALYSIS OF 5* NCR SEQUENCES WITH RsaUHaeUL ScrFl AND Hinfl 



Observed cleavage pattern 
HaeTH 

ScrFl 



Hinfl 



Inferred HCV type 
12 3 4 



a 
b 


A/B 
A/B 


a 
a 


2 
36 


- 


- 




- 


a 
a 
b 


A/B 
A/B 
A/B 


b 
c 
b 








16 
2 

6 




c 
d 


D 
E 


n.d. 
n.d. 




7 
3 








f 

a 
e 


G 
G 


n.d 
n.d. 






7 
3 






h 
i 


A/B 
A/B 


b 
b 

TOTALS 


38 


10 


10 


24 


1 
1 

2 



• Two samoies yielded unusual restriction patterns with HaeUlfRsal (h. il S^ucnce analysis 
of the 5'NCR placed them outside existing HCV classification tsampies IQ-48. kG-,6). 
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as bacterial, yeast, mammalian and insect cells. 
Particular examples of such cells are £ . coli , 
S . cerevisiae , P. pastoris , Chinese hamster ovary and mouse 
cells, and Soodootera f rucriperda and Tricoplusia ni . The 
choice of host cell may depend on a number of factors but, 
if post-translational modification of the HCV viral 
peptide is important, then an eukaryotic host would be 
preferred . 

The present invention also provides a process for 
preparing a peptide as defined herein which comprises 
isolating the DNA sequence, as herein defined, from the 
HCV genome, or synthesising DNA sequence encoding the 
peptides as defined herein, or generating a DNA sequence 
encoding the peptide, inserting the DNA sequence into an 
expression vector such that it is capable, in an 
appropriate host, of being expressed; transforming host 
cells with the expression vector, culturing the 
transformed host cells, and isolating the peptide. 

The DNA sequence encoding the peptide may be 
synthesised using standard procedures (Gait, 
Oligonucleotide Synthesis: A Practical Approach , 1984, 
Oxford, IRL Press) . 

The desired DNA sequence obtained as described above 
may be inserted into an expression vector using known and 
standard techniques. The expression vector is normally 
cut using restriction enzymes and the DNA sequence 
inserted using blunt-end or staggered-end ligation- The 
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response, into a mammalian host, such as a mouse, rat, 
sheep or rabbit, and recovering the antibody thus 
produced. The peptide is generally administered in the 
form of an injectable formulation in which the peptide is 
admixed with a physiologically acceptable diluent. 
Adjuvants, such as Freund's complete adjuvant (FCA) or 
Freund's incomplete adjuvant (FIA) , may be included in the 
formulation. The formulation is normally injected into 
the host over a suitable period of time, plasma samples 
being taken at appropriate intervals for assay for 
anti-HCV viral antibody. When an appropriate level of 
activity is obtained, the host is bled. Antibody is then 
extracted and purified from the blood plasma using 
standard procedures, for example, by protein A or 
ion-exchange chromatography . 

Monoclonal antibody against a peptide of the present 
invention may be obtained by fusing cells of an 
immortalising cell line with cells which produce antibody 
against the viral or topographically related peptide, and 
culturing the fused immortalised cell line. Typically, a 
non-human mammalian host, such as a mouse or rat, is 
inoculated with the peptide. After sufficient time has 
elapsed for the host to mount an antibody response, 
antibody producing cells, such as the splenocytes, are 
removed. Cells of an immortalising cell line, such as a 
mouse or rat myeloma cell line, are fused with the 
antibody producing cells and the resulting fusions 
screened to identify a cell line, such as a hybridoma, 
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infection, or the response to a particular viral antigen 
may be associated with clearance of the virus. Thus the 
exact approach adopted for the diagnosis of a viral 
infection depends upon the particular circumstances and 
the information sought. In the case of HCV, a diagnostic 
assay may embody any one of these three approaches. 

In an assay for the diagnosis of HCV involving 
detection of viral nucleic acid, the method may comprise 
hybridising viral RNA present in a test sample, or cDNA 
synthesised from such viral RNA, with a DNA sequence 
corresponding to the nucleotide sequences of the present 
invention or encoding a peptide of the invention, and 
screening the resulting nucleic acid hybrids to identify 
any HCV viral nucleic acid. The application of this 
method is usually restricted to a test sample of an 
appropriate tissue, such as a liver biopsy, in which the 
viral RNA is likely to be present at a high level. The 
DNA sequence corresponding to a nucleotide sequence of the 
present invention or encoding a peptide of the invention 
may take the form of an oligonucleotide or a cDNA sequence 
optionally contained within a plasmid. Screening of the 
nucleic acid hybrids is preferably carried out by using a 
labelled DNA sequence. Preferably the peptide of the 
present invention is part of an oligonucleotide wherein 
the label is situated at a sufficient distance from the 
peptide so that binding of the peptide to the viral 
nucleic acid is not interfered with by virtue of the label 



WO 93/10239 



PCT/GB92/02143 



- 63 - 

taken from any of the appropriate tissues and 
physiological fluids mentioned above for the detection of 
viral nucleic acid. If a physiological fluid is obtained, 
it may optionally be concentrated for any viral antigen or 
antibody present. 

A variety of assay formats may be employed. The 
peptide can be used to capture selectively antibody 
against HCV from solution, to label selectively the 
antibody already captured, or both to capture and label 
the antibody. In addition, the peptide may be used in a 
variety of homogeneous assay formats in which the antibody 
reactive with the peptide is detected in solution with no 
separation of phases. 

The types of assay in which the peptide is used to 
capture antibody from solution involve immobilization of 
the peptide on to a solid surface. This surface should be 
capable of being washed in some way. Examples of suitable 
surfaces include polymers or various types (moulded into 
microtitre wells; beads; dipsticks of various types; 
aspiration tips; electrodes; and optical devices), 
particles (for example latex; stabilized red blood cells; 
bacterial or fungal cells; spores; gold or other metallic 
or metal-containing sols; and proteinaceous colloids) with 
the usual size of the particle being from 0.02 to 5 
microns, membranes (for example of nitrocellulose; paper; 
cellulose acetate; and high porosity/high surface area 
membranes of an organic or inorganic material) . 
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may be achieved by use of labelled molecule or particle as 
described above which will react: with the captured 
antibody (for example protein A or protein G and the like; 
anti-species or anti-immunoglobul in-sub-type ; rheumatoid 
factor; or antibody to the peptide, used in a competitive 
or blocking fashion) , or any molecule containing an 
epitope contained in the peptide. 

The detectable signal may be optical or radioactive 
or physico-chemical and may be provided directly by 
labelling the molecule or particle with, for example, a 
dye, radiolabel, electroact ive species, magnetically 
resonant species or fluorophore, or indirectly by 
labelling the molecule or particle with an enzyme itself 
capable of giving rise to a measurable change of any 
sort. Alternatively the detectable signal may be. obtained 
using, for example, agglutination, or through a 
diffraction or birefringent effect if the surface is in 
the form of particles. 

Assays in which a peptide itself is used to label an 
already captured antibody require some form of labelling 
of the peptide which will allow it to be detected. The 
labelling may be direct by chemically or passively 
attaching for example a radio label, magnetic resonant 
species, particle or enzyme label to the peptide; or 
indirect by attaching any form of label to a molecule 
which will itself react with the peptide. The chemistry 
of bonding a label to the peptide can be directly through 
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magnetic resonance or enzyme measurement) . Addition of 
either viral peptide or antibody in a sample results in 
restriction of the interaction of the labelled pair and 
thus in a different level of signal in the detector. 

A suitable assay format for detecting HCV antibody is 
the direct sandwich enzyme immunoassay (EIA) format. A 
peptide is coated onto microtitre wells. A test sample 
and a peptide to which an enzyme is coupled are added 
simultaneously. Any HCV antibody present in the test 
sample binds both to the peptide coating the well and to 
the enzyme-coupled peptide. Typically, the same peptide 
are used on both sides of the sandwich- After washing, 
bound enzyme is detected using a specific substrate 
involving a colour change. A test kit for use in such an 
EIA comprises: 

(1) a peptide, as herein defined labelled with an" enzyme; 

(2) a substrate for the enzyme; 

(3) means providing a surface on which a peptide is 
immobilised ; and 

(4) optionally, washing solutions and/or buffers. 

It is also possible to use IgG/IgM antibody capture 
ELISA wherein an antihuman antibody is coated onto 
microlitre wells, a test sample is added to the well. Any 
IgG or IgM antibody present in the test sample will then 
bind to the anti-human antibody. A peptide of the present 
invention , which has been labelled, is added to the well 
and the peptide will bind to any IgG or IgM antibody which 
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other HCV genotypes. Accordingly/ it is clearly desirable 
to supplement testing for HIV-1 with testing for all other 
genotypes, for example, types 2, 3 and 4, and also any 
further genotypes that may be discovered. 

To test for a spectrum of genotypes, there may be 
provided a series of assay means each comprising one or 
more antigenic peptides from one genotype of HCV, for 
example, a series of wells in a microtitre plate, or an 
equivalent series using the bead format. Such an assay 
format may be used to determine the genotype of HCV 
present in a sample. Alternatively, or in addition, an 
assay means may comprise antigenic peptides from more than 
one genotype, for example, a microwell or bead may be 
coated with peptides from more than one genotype. 

It has been found advantageous to use more than one 
HCV antigen for testing, in particular, a combination 
comprising at least one antigenic peptide derived from the 
structural region of the genome and at least one antigenic 
peptide derived from the non-structural region, especially 
a combination of a core antigen and at least one antigen 
selected from the NS3 , NS4 and NS5 regions. The wells or 
beads may be coated with the antigens individually. It 
has been found advantageous, however, to fuse two or more 
antigenic peptides as a single polypeptide, preferably as 
a recombinant fusion polypeptide. Advantages of such an 
approach are that the individual antigens can be combined 
in a fixed, predetermined ratio (usually equimolar) and 
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The peptide of the present invention may be 
incorporated into a vaccine formulation for inducing 
immunity to HCV in man. For this purpose the peptide may 
be presented in association with a pharmaceutically 
acceptable carrier. 

For use in a vaccine formulation, the peptide may 
optionally be presented as part of an hepatitis B core 
fusion particle, as described in Clarke et al ( Nature , 
1987, 330 , 381-384), or a polylysine based polymer, as 
described in Tarn ( PNAS , 1988, 85, 5409-5413), 
Alternatively, the peptide may optionally be attached to a 
particulate structure, such as lipsomes or ISCOMS. 

Pharmaceutically acceptable carriers include liquid 
media suitable for use as vehicles to introduce the 
peptide into a patient. An example of such liquid media 
is saline solution. The peptide may be dissolved or 
suspended as a solid in the carrier. 

The vaccine formulation may also contain an adjuvant 
for stimulating the immune response and thereby enhancing 
he effect of the vaccine. Examples of adjuvants include 
: .uminium hydroxide and aluminium phosphate. 

The vaccine formulation may contain a final 
-;entration of peptide in the range from 0.01 to 
i '/ml, preferably from 0.03 to 2 mg/ml . The vaccine 
ft .ilation may be incorporaterd into a sterile container, 
wh. '. is then sealed and stored at a low temperature, for 
exa: le 4°C, or may be f reeze-dried . 
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S EOUENCE LISTING ( FIGURE 1 and la) 



SEQ ID NO 



SEQUENCE TYPE : Nucleotide cDNA sequence 



SEQUENCE LENGTH : 194 base pairs 



STRANDEDNESS : single 



TOPOLOGY 



linear 



MOLECULAR TYPE : genomic DNA from HCV types 1-3 



ORIGINAL SOURCE 



ORGANISM 



human blood samples 



USE 



DEPOSIT 



FEATURES 



Bases - 255 to -62 of the 5' non-coding region showing 
variations in cDUA sequence between HCV types ,1-3 < 
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SEQUENCE LISTING (FIGURE 5) 

SEQ ID NO : 3 

SEQUENCE TYPE : deduced peptide sequence 

SEQUENCE LENGTH : 57 amino acids 

STRANDEDNESS : single 

TOPOLOGY : linear 

MOLECULAR TYPE : HCV peptide 

ORIGINAL SOURCE 

ORGANISM : human blood samples 

USE : 

DEPOSIT : 
FEATURES 

Amino acids 1577 to 1633 of NS-3 region showing variations 
in peptide sequence between HCV types 1-3. 
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SEQUENCE LISTING (FIGURE 9a) 

SEQ ID NO : 5 

SEQUENCE TYPE : Nucleotide sequence 

SEQUENCE LENGTH : 3 67 base pairs 

STRANDEDNESS : single 

TOPOLOGY : linear 

MOLECULAR TYPE : HCV genomic DNA 

ORIGINAL SOURCE 

ORGANISM : human blood samples 

USE 

DEPOSIT : 
FEATURES 

Bases 4911 to 5277 of NS-4 region of HCV-3 showing 
individual variations and consensus sequence, ^ 
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SEQUENCE LISTING (FIGURE 10a and 10b) 

SEQ ID NO : 7 

SEQUENCE TYPE : peptide sequence 

SEQUENCE LENGTH : 9 0 amino acids 

STRANDEDNESS : single 

TOPOLOGY : linear 

MOLECULAR TYPE : HCV peptide 

ORIGINAL SOURCE 

ORGANISM : human blood samples 

USE : 

DEPOSIT : 
FEATURES 

Amino acids 1679 to 1768 of NS-4 region of HCV1-3 showing 
individual variations and consensus sequence. 
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SEQUENCE LISTING (FIGURE 13) 



SEQ ID NO 



SEQUENCE TYPE : Nucleotide cDNA sequences 



SEQUENCE LENGTH : 30, 70 and 32 base pairs 



STRANDEDNESS : single 



TOPOLOGY. 



linear 



MOLECULAR TYPE : genomic DNA from HCV types 1 to 4 



ORIGINAL SOURCE 



ORGANISM 



: human blood samples 



USE 



DEPOSIT 



FEATURES 



Bases -245 to -216; -185 to -116; and -101 to -70 in the 
5 'NCR region of HCV types 1-4.. 
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CLAIMS 

1. A polynucleotide sequence unique to hepatitis C virus 
type 3 or 4 (HCV-3 or HCV-4) . 

2. A polynucleotide sequence according to claim 1 which 
is a cDNA sequence . 

3. An HCV-3 or HCV-4 specific polynucleotide sequence 
from the core, NS3 , NS4 or NS5 regions. 

4. An HCV-3* or HCV-4 specific polynucleotide sequence 
comprised within the sequences shown in Figures 1 and la, 
9a, 13 and 15a. 

5. An HCV-3 or HCV-4 polynucleotide sequence which codes 
for an antigenic peptide. 

V n • , V 

6. An HCV-3 or HCV-4/polynucleotide sequence which codes 

for C-100, 5-1-1, C33 or C22 peptides having sequence 
alterations unique to HCV-3 or HCV-4 . 

7. An HCV-3 polynucleotide sequence encoding the 
polypeptide 

KPALVPDKEVLYQQYDEM or the polypeptide 
ECSQAAPYIEQAQVIAHQF 
or polypeptides of substantially equivalent antigenicity. 
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16. An antigenic NS4 peptide specific to HCV-4 which 
comprises the HCV-4 sequence substantially from position 
1691 to 1708 or the sequence substantially from position 
1710 to 1728. 

17. A fusion peptide which comprises at least two of the 
antigens of claims 9 to 16. 

18 . A fusion peptide according to claim 17 which 
comprises at least one of the antigens of claims 9 to 16, 
fused to B-galactosidase, GST , trp E, or polyhedriri coding 
sequence . 

19 . A peptide according to any of claims 9 to 18 which is 
labelled. 

20. A vaccine formulation which comprises an antigen 
according to any of claims 9-18 . 

21. Antibody to the antigenic peptide of any of claims 9 
to 19. 

22. An immunoassay device which comprises a solid 
substrate having attached thereto antigen according to any 
of claims 9 to 19 . 

23. A device according to claim 22 for HCV screening 
which comprises a mixture of antigens. 
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30. A method of in vitro HCV typing which comprises 

- carrying out endonuclease digestion of an HCV- 
containing sample employing ScrFI or Haelll/Rsal 
endonuclease; and 

- comparing the restriction patterns with 
characteristic type-specific patterns, 

31. A method according to claim 3 0 or 31 which further 
employs Hinfl in a separate or combined digestion. 

32. A method of in vitro HCV typing which comprises 

- carrying out endonuclease digestion of an 
HCV-containing sample employing ScrFI endonuclease, 
the restriction pattern being characteristic of 
HCV-1, HCV-2 and HCV- 3 ; 

- carrying out endonuclease digestion employing Hinfl 
endonuclease, the restriction pattern being 
characteristic of HCV-4 . 

33. A method of in vitro screening a sample for HCV 
antibodies which comprises; 

- carrying out an immunoassay employing an antigenic 
peptide according to any of claims 1 to 19 or mixture 
thereof; and 

- detecting any antibody - antigen complex produced. 
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FIGURE 6 
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FIGURE 9a 



NUCLEOTIDE SEQUENCES 



4911 
» 

T0040 GACACACCCU 
T003a 7 7CACACCCC 
T003S ? 7CACACCCC 
T002S ? 7 7 7CACCCC 
TIT 37 7 7 7 7CACCCC 



GUCACAAAAU 
GUCACAAAAU 
AUCACAAAAU 
AUCGCGAAAU 
AUCACAAAAU 



ACAUCAUGGC 
ACAUCAUGGC 
ACAUCAUGGC 
ACCUCAUGGC 
ACGUCAUGGC 



AUGCAUGUCA 
AUGCAUGUCA 
AUGCAUGUCA 
AUGUAUGUCA 
AUGCAUGUCA 



GCUGAUCUGG 
GCUGAUCUGG 
GCUGAUCUGG 
GCUGAUCUGG 
GCUGAUCUGG 



4970 
▼ 

AAGUAACCAC 
AAGUAACCAC 
AAGUAACCAC 
AAGUAACCAC 
AAGUAACCAC 



Cans GACACACCCU 

4971 
▼ 

TO 040 CAGCACCUGG 
TO 0 3 3 CAGCACCUGG 
TO 03 5 CAGCACCUGG 
TO 02 5 CAGCACCUGG 
T1737 



GUCACAAAAU ACAUCAUGGC AUGCAUGUCA GCUGAUCUGG AAGUAACCAC 

5030 



GUGUUGCUUG 
GUGUUGCUUG 
GUGUUGCUUG 
GUGUUGCUUG 
GUGUUGCUUG 



GAGGGGUCCU CGCGGCCCUA GCGGCCUACU GCUUGUCAGU 

GAGGGGUCCU CGCGGCCCUA GCGGCCUACU GCUUGUCAGU 

GAGGGGUCCU CGCGGCCCUA GCGGCCUACU GCUUGUCAGU 

GAGGAGUCCU CGCUGCCCUA GCGGCCUACU GCUUGUCAGU 

GAGGGGUCCU CGZGGCZCUX GCGGCCUACU GCUUGUCAGU 



Cans 



Iz^ZCZGG GUGUUGCUUG GAGGGGUCCU CGCGGCZZVX GCGGCCUACU GCUUGUCAGU 



5031 



TO 040 CGGCUGCGUU GUGAUUGUGG GUCAUAUUGA 
^0 03 3 CGGCUGCGUU GUGAUUGUGG GCCAUAUUGA 
^0035 CGGCUGCGUU GUGAUUGUGG GCCAUAUUGA 
^0 02 5 CGGCUGCGUU GUGAUUGUGG GUCAUAUUGA 
Ti 787 CGGCUGCGUU GUGAUUGUGG GUCAUAUUGA 



5090 

GCUGGGGGGC AAGCCGGCA AUCGUUCCAGA 
GCUGGGGGGC AAGCCCGCA CUCGUUCCAGA 
GCUGGGGGGC AAGCCGGCA CUCGUUCCAGA 
GCUGGGGGGC AAGCCAGCA CUCGUUCCAGA 
GCUGGGAGGC AAGCCGGCA CUCGUUCCAGA 



Cans CGGCl 
5091 



IGUU GUGAUUGUGG GUCAUAUUGA 



T0040 
T0033 
TO 03 5 
T002 5 
T1737 



CAAAGAGGUG 
CAAAGAAGUG 
CAAAGAGGUG 
CAAAGAGGUG 
CAAGGAGGUG 



UUGUAUCAAC 
UUGUAUCAAC 
UUGUAUCAAC 
UUGUAUCAAC 
UUGUAUCAAC 



AAUACGAUGA 
AAUACGAUGA 
AAUACGAUGA 
AAUACGAUGA 
AAUACGAUGA 



GAUGGAGGAG 
GAUGGAGGAG 
GAUGGAGGAG 
GAUGGAGGAG 
GAUGGAGGAG 



AAGCCGGCA AUCGUUCCAGA 

5150 
▼ 

UGCUCGCAA GCUGCCCCAUA 
UGCUCGCAA GCCGCCCCAUA 
UGCUCGCAA GCCGCCCCAUA 
UGCUCGCAA GCCGCCCCAUA 
UGCUCGCAA GCCGCCCCAUA 



Cans 



515: 



UUGUAUCAAC AAUACGAUGA GAUGGAGGAG UGCUCGCAA GCUGCCCCAUA 

5210 



TOO 40 
TO 03 3 
T0036 
T0025 
T1737 



UAUCGAACAA 
UAUCGAACAA 
UAUCGAACAA 
UAUCGAACAA 
UAUCGAACAA 



GCUCAGGUGA 
GCUCAAGUAA 
GCUCAGGUAA 
GCUCAGGUAA 
GCUCAGGUAA 



UAGCCCACCA 
UAGCCCACCA 
UAGCCCACCA 
UAGCCCACCA 
UAGCCCACCA 



GUUCAAGGAG 
GUUCAAGGAG 
GUUCAAGGAG 
GUUCAAGGAG 
GUUCAAGGAG 



AAAGUCCUu 
AAAGUCCUU 
AAAGUCCUU 
AAAGUCCUU 
AAAGUCCUU 



GGAUUGCUGCA 
GGAUUGCUGCA 
GGAUUGCUGCA 
GGAUUGCUGCA 
GGGUUGCUGCA 



Cons UAUCGAACAA 



5211 



GCUCAGGUGA UAGCCCACCA GUUCAAGGAG AAAGUCCUU GGAUUGCUGCA 

5270 



T0040 
T0033 
T0035 
TOO 2 6 
T1737 



GCGGGCCACC 



GCGAGCCACC 
GCG^JGCCkCZ 



CAACAACAAG 
CAACAACAAG 
CAACAACAAG 
CAACAACAAG 
C AACAAC AGG 



CUGUUAUUGA 
CUGUUAUUGA 
CUGUUAUUGA 
CUGUCAUUGA 
CUGUCAUUGA 



GCCAAUAGUA 
GCCCAUAGUA 
GCCAAUAGUA 
GCCCAUAGUA 
GCCCAUAGUA 



GCUACCAAC 
GCUACCAAC 
GCUACCAAC 
GCUACCAAC 
GCUACCAAC 



UGGCAAAAGCU 
UGGCAAAAGCU 
UGGCAAAA? 7 7 
UGGCAAAA7 7 7 
UGGCAAAAGCU 



Cans GCGAGCC 



IzZ CAACAACAAG CUGUUAUUGA GCCCAUAGUA GCUACCAAC UGGCAAAAGCU 



5271 



T0040 
T0033 
T0036 
T0025 
T1737 



UGAGACC 
UG^GCCZ 
7777777 
7777777 
UGAGGCU 



Cans UGAGACC 
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FIGURE 9b 

DKBOC2D AMINO ACID SSQUZSCZS 



1787 

1638 

mnntn ™tivp vvtmi CMSAD LEVTT STWVL LGGVL AALAA YCLSV GCWI VGKIE 

SSS™ K^IMA* CMSAD LEVTT STWVL LGGVL AALAA YCLSV GCWI VGHIE 
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FIGURE 11a 
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